SEQUENCE LISTING 



<110> Black Jr., Charles A. 



<120> COMPOSITIONS AND METHODS FOR ACTIVATING 
GENES OF INTEREST 



<130> 5722-2(35722/191928) 



<140> 09/446,402 
<141> 1999-12-20 



<150> PCT/US98/13093 
<151> 1998-06-24 



<150> 60/050,772 
<151> 1997-06-25 



<160> 19 
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<210> 1 
<211> 4279 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Recombinant Molecule containing multiple cloning 
site^ kozak sequence^ LacZ gene. 



<221> misc_feature 

<222> (1) . . . (64) 

<223> Multiple cloning site 



<221> misc_feature 
<222> (65) • . . (79) 

<223> Consensus sequence for the "Kozak sequence" 
(translation initiation) 



<221> prim_transcript 
<222> (80) . . . (4279) 
<223> Beta galactosidase 



<400> 1 

ttaatacgac 

ccgggccgcc 

tggttaggta 

ggaaagtccc 

gcaaccaggt 

ctcaattagt 

cccagttccg 

gaggccgcct 

ggcttttgca 



tcactatagg 
gccaccatgg 
ccttctgagg 
caggctcccc 
gtggaaagtc 
cagcaaccat 
cccattctcc 
cggcctctga 
aaaagcttgg 



ctagcctcga 
cgcagcacca 
cggaaagaac 
agcaggcaga 
cccaggctcc 
agtcccgccc 
gccccatggc 
gctattccag 
gatctctata 



gaattcacgc 
tggcctgaaa 
cagctgtgga 
agtatgcaaa 
ccagcaggca 
ctaactccgc 
tgactaattt 
aagtagtgag 
atctcgcgca 



gtggtacctc 
taacctctga 
atgtgtgtca 
gcatgcatct 
gaagtatgca 
ccatcccgcc 
tttttattta 
gaggcttttt 
acctattttc 



tagagtcgac 
aagaggaact 
gttagggtgt 
caattagtca 
aagcatgcat 
cctaactccg 
tgcagaggcc 
tggaggccta 
ccctcgaaca 



60 
120 
180 
240 
300 
360 
420 
480 
540 
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ctttttaagc cgtagataaa caggctggga cacttcacat gagcgaaaaa tacatcgtca 600 

cctgggacat gttgcagatc catgcacgta aactcgcaag ccgactgatg ccttctgaac 660 

aatggaaagg cattattgcc gtaagccgtg gcggtctggt accggtgggt gaagaccaga 720 

aacagcacct cgaactgagc cgcgatattg cccagcgttt caacgcgctg tatggcgaga 780 

tcgatcccgt cgttttacaa cgtcgtgact gggaaaaccc tggcgttacc caacttaatc 840 

gccttgcagc acatccccct ttcgccagct ggcgtaatag cgaagaggcc cgcaccgatc 900 

gcccttccca acagttgcgc agcctgaatg gcgaatggcg ctttgcctgg tttccggcac 960 

cagaagcggt gccggaaagc tggctggagt gcgatcttcc tgaggccgat actgtcgtcg 1020 

tcccctcaaa ctggcagatg cacggttacg atgcgcccat ctacaccaac gtaacctatc 1080 

ccattacggt caatccgccg tttgttccca cggagaatcc gacgggttgt tactcgctca 1140 

catttaatgt tgatgaaagc tggctacagg aaggccagac gcgaattatt tttgatggcg 1200 

ttaactcggc gtttcatctg tggtgcaacg ggcgctgggt cggttacggc caggacagtc 1260 

gtttgccgtc tgaatttgac ctgagcgcat ttttacgcgc cggagaaaac cgcctcgcgg 1320 

tgatggtgct gcgttggagt gacggcagtt atctggaaga tcaggatatg tggcggatga 1380 

gcggcatttt ccgtgacgtc tcgttgctgc ataaaccgac tacacaaatc agcgatttcc 1440 

atgttgccac tcgctttaat gatgatttca gccgcgctgt actggaggct gaagttcaga 1500 

tgtgcggcga gttgcgtgac tacctacggg taacagtttc tttatggcag ggtgaaacgc 1560 

aggtcgccag cggcaccgcg cctttcggcg gtgaaattat cgatgagcgt ggtggttatg 1620 

ccgatcgcgt cacactacgt ctgaacgtcg aaaacccgaa actgtggagc gccgaaatcc 1680 

cgaatctcta tcgtgcggtg gttgaactgc acaccgccga cggcacgctg attgaagcag 1740 

aagcctgcga tgtcggtttc cgcgaggtgc ggattgaaaa tggtctgctg ctgctgaacg 1800 

Ip gcaagccgtt gctgattcga ggcgttaacc gtcacgagca tcatcctctg catggtcagg 1860 

tcatggatga gcagacgatg gtgcaggata tcctgctgat gaagcagaac aactttaacg 1920 

J/ccgtgcgctg ttcgcattat ccgaaccatc cgctgtggta cacgctgtgc gaccgctacg 1980 

1^' gcctgtatgt ggtggatgaa gccaatattg aaacccacgg catggtgcca atgaatcgtc 2040 

tgaccgatga tccgcgctgg ctaccggcga tgagcgaacg cgtaacgcga atggtgcagc 2100 

"^1' gcgatcgtaa tcacccgagt gtgatcatct ggtcgctggg gaatgaatca ggccacggcg 2160 

ctaatcacga cgcgctgtat cgctggatca aatctgtcga tccttcccgc ccggtgcagt 2220 

atgaaggcgg cggagccgac accacggcca ccgatattat ttgcccgatg tacgcgcgcg 2280 

Ifl tggatgaaga ccagcccttc ccggctgtgc cgaaatggtc catcaaaaaa tggctttcgc 2340 

tacctggaga gacgcgcccg ctgatccttt gcgaatacgc ccacgcgatg ggtaacagtc 2400 

12 ttggcggttt cgctaaatac tggcaggcgt ttcgtcagta tccccgttta cagggcggct 24 60 

tcgtctggga ctgggtggat cagtcgctga ttaaatatga tgaaaacggc aacccgtggt 2520 

1^1 cggcttacgg cggtgatttt ggcgatacgc cgaacgatcg ccagttctgt atgaacggtc 2580 

i5 "'^ggtctttgc cgaccgcacg ccgcatccag cgctgacgga agcaaaacac cagcagcagt 2640 

JJttttccagtt ccgtttatcc gggcaaacca tcgaagtgac cagcgaatac ctgttccgtc 2700 

Matagcgataa cgagctcctg cactggatgg tggcgctgga tggtaagccg ctggcaagcg 2760 

I*'" gtgaagtgcc tctggatgtc gctccacaag gtaaacagtt gattgaactg cctgaactac 2820 

cgcagccgga gagcgccggg caactctggc tcacagtacg cgtagtgcaa ccgaacgcga 2880 

ccgcatggtc agaagccggg cacatcagcg cctggcagca gtggcgtctg gcggaaaacc 2940 

tcagtgtgac gctccccgcc gcgtcccacg ccatcccgca tctgaccacc agcgaaatgg 3000 

atttttgcat cgagctgggt aataagcgtt ggcaatttaa ccgccagtca ggctttcttt 3060 

cacagatgtg gattggcgat aaaaaacaac tgctgacgcc gctgcgcgat cagttcaccc 3120 

gtgcaccgct ggataacgac attggcgtaa gtgaagcgac ccgcattgac cctaacgcct 3180 

gggtcgaacg ctggaaggcg gcgggccatt accaggccga agcagcgttg ttgcagtgca 3240 

cggcagatac acttgctgat gcggtgctga ttacgaccgc tcacgcgtgg cagcatcagg 3300 

ggaaaacctt atttatcagc cggaaaacct accggattga tggtagtggt caaatggcga 3360 

ttaccgttga tgttgaagtg gcgagcgata caccgcatcc ggcgcggatt ggcctgaact 3420 

gccagctggc gcaggtagca gagcgggtaa actggctcgg attagggccg caagaaaact 3480 

atcccgaccg ccttactgcc gcctgttttg accgctggga tctgccattg tcagacatgt 3540 

ataccccgta cgtcttcccg agcgaaaacg gtctgcgctg cgggacgcgc gaattgaatt 3600 

atggcccaca ccagtggcgc ggcgacttcc agttcaacat cagccgctac agtcaacagc 3660 

aactgatgga aaccagccat cgccatctgc tgcacgcgga agaaggcaca tggctgaata 3720 

tcgacggttt ccatatgggg attggtggcg acgactcctg gagcccgtca gtatcggcgg 3780 

aattccagct gagcgccggt cgctaccatt accagttggt ctggtgtcaa aaataataat 3840 
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aaccgggcag gccatgtctg cccgtatttc gcgtaaggaa atccattatg tactatttaa 3900 

aaaacacaaa cttttggatg ttcggtttat tctttttctt ttactttttt atcatgggag 3960 

cctacttccc gtttttcccg atttggctac atgacatcaa ccatatcagc aaaagtgata 4020 

cgggtattat ttttgccgct atttctctgt tctcgctatt attccaaccg ctgtttggtc 4080 

tgctttctga caaactcgga acttgtttat tgcagcttat aatggttaca aataaagcaa 4140 

tagcatcaca aatttcacaa ataaagcatt tttttcactg cattctagtt gtggtttgtc 4200 

caaactcatc aatgtatctt atcatgtctg gatcctctag agtcgacctg caggcatgca 4260 

agctggcact ggccgtcgt 4279 

<210> 2 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Synthetic oligonucleotide 
<400> 2 

gaatacaaag cttatgcatg 20 

<210> 3 
|;^<211> 13 
;'|^<212> DNA 

'S<213> Artificial Sequence 
^]<220> 

y^<223> Synthetic oligonucleotide 
^^p<4 00> 3 

|g gaatacaaag ctt 13 

%,<210> 4 
"^<211> 20 
Jft<212> DNA 

-^^<213> Artificial Sequence 
O<220> 

|s^<223> Synthetic oligonucleotide 
<400> 4 

aaagcttatg catgcggccg 20 

<210> 5 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide 
<400> 5 

cggccgcatc tagagggccc 20 



<210> 6 
<211> 25 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide 
<400> 6 

gcggccgcat ctagagggcc cggat 

<210> 7 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide 
<400> 7 

aatacaaagc ttatgcatgc ggcc 

<210> 8 
<211> 30 
^ <212> DNA 

' <213> Artificial Sequence 



I <220> 

<223> Synthetic oligonucleotide 
J <400> 8 

, aatacaaagc ttatgcatgc ggccgcatct 

, <210> 9 
: <211> 20 
■ <212> DNA 

"^<213> Artificial Sequence 
U220> 

:<223> Synthetic oligonucleotide 
<400> 9 

catgcataag ctttgtattc 

<210> 10 

<211> 13 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 10 
aagctttgta ttc 

<210> 11 
<211> 20 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide 
<400> 11 

cggccgcatg cataagcttt 

<210> 12 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide 
<400> 12 

gggccctcta gatgcggccg 

<210> 13 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide 
<400> 13 

atccgggccc tctagatgcg gccgc 

<210> 14 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide 
<400> 14 

ggccgcatgc ataagctttg tatt 

<210> 15 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Synthetic oligonucleotide 
<400> 15 

agatgcggcc gcatgcataa gctttgtatt 

<210> 16 
<211> 1798 



<212> RNA 
<213> Unknown 



<220> 

<223> mRNA sequence for Firefly luciferase 
<400> 16 

gaauacaaag cuuaugcaug cggccgcauc uagagggccc ggauccaaau ggaagacgcc 60 

aaaaacauaa agaaaggccc ggcgccauuc uauccucuag aggauggaac cgcuggagag 120 

caacugcaua aggcuaugaa gagauacgcc cugguuccug gaacaauugc uuuuacagau 180 

gcacauaucg aggugaacau cacguacgcg gaauacuucg aaauguccgu ucgguuggca 24 0 

gaagcuauga aacgauaugg gcugaauaca aaucacagaa ucgucguaug cagugaaaac 300 

ucucuucaau ucuuuaugcc gguguugggc gccguuauuu aucggaguug caguugcgcc 360 

cgcgaagcac auuuauaaug aacgugaauu gcucaacagu augaacauuu cgcagccuac 420 

cguaguguuu guuuccaaaa agggguugca aaaaauuuug aacgugcaaa aaaaauuacc 4 80 

aauaauccag aaaauuauua ucauggauuc uaaaacggau uaccagggau uucagucgau 540 

guacacguuc gucacaucuc aucuaccucc cgguuuuaau gaauacgauu uuguaccaga 600 

guccuuugau cgugacaaaa caauugcacu gauaaugaau uccucuggau cuacuggguu 660 

accuaagggu guggcccuuc cgcauagaac ugccugcguc agauucucgc augccagaga 720 

uccuauuuuu ggcaaucaaa ucauuccgga uacugcgauu uuaaguguug uuccauucca 780 

ucacgguuuu ggaauguuua cuacacucgg auauuugaua uguggauuuc gagucgucuu 840 

Q aauguauaga uuugaagaag agcuguuuuu acgaucccuu caggauuaca aaauucaaag 900 

^Ji ugcguugcua guaccaaccc uauuuucauu cuucgccaaa agcacucuga uugacaaaua 960 

^ cgauuuaucu aauuuacacg aaauugcuuc ugggggcgca ccucuuucga aagaagucgg 1020 

Sii ggaagcgguu gcaaaacgcu uccaucuucc agggauacga caaggauaug ggcucacuga 1080 

gacuacauca gcuauucuga uuacacccga gggggaugau aaaccgggcg cggucgguaa 114 0 

^2 aguuguucca uuuuuugaag cgaagguugu ggaucuggau accgggaaaa cgcugggcgu 1200 

CJ uaaucagaga ggcgaauuau gugucagagg accuaugauu auguccgguu auguaaacaa 1260 

uccggaagcg accaacgccu ugauugacaa ggauggaugg cuacauucug gagacauagc 1320 

ijl uuacugggac gaagacgaac acuucuucau aguugaccgc uugaagucuu uaauuaaaua 1380 

^. caaaggauau cagguggccc ccgcugaauu ggaaucgaua uuguuacaac accccaacau 14 40 

cuucgacgcg ggcguggcag gucuucccga cgaugacgcc ggugaacuuc ccgccgccgu 1500 

uguuguuuug gagcacggaa agacgaugac ggaaaaagag aucguggauu acgucgccag 1560 

ucaaguaaca accgcgaaaa aguugcgcgg aggaguugug uuuguggacg aaguaccgaa 1620 

12 ^99^cuuacc ggaaaacucg acgcaagaaa aaucagagag auccucauaa aggccaagaa 1680 

gggcggaaag uccaaauugu aaaauguaac uguauucagc gaugacgaaa uucuuagcua 1740 

Muuguaauccu ccgagggggc gagcucccaa aaaaaaaaaa aaaaaaaaaa aaaaaaaa 1798 

<210> 17 
<211> 12 
<212> DNA 

<213> Consensus Kozack sequence 
<400> 17 

gccgccgcca tg 12 

<210> 18 
<211> 13 
<212> DNA 

<213> Consensus Kozack sequence 
<400> 18 

gccgccrcca ugg 13 
<210> 19 
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<211> 8 
<212> RNA 

<213> Consensus Shine-Dalgarno sequence 

<400> 19 
uaaggagg 
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